Relational Reinforcement Learning
نویسنده
چکیده
Reinforcement learning [10] is a subtopic of machine learning that is concerned with software systems that learn to behave through interaction with their environment and receive only feedback on the quality of their current behavior instead of a set of correctly labelled learning examples. Although reinforcement learning algorithms have been studied extensively in a propositional setting, their usefulness in complex problems is limited by their inability to incorporate relational information about the environment [9]. Relational Reinforcement Learning is concerned with reinforcement learning in domains that exhibit structural properties and in which different kinds of related objects exist. These domains are usually characterized by a very large and possibly unbounded number of different possible states and actions. In this kind of environment, most traditional reinforcement learning techniques break down. This thesis discusses the development of a first applicable relational reinforcement learning (or RRL) system [6,7]. This RRL system combines Q-learning with the representational power of relational learning by using relational representations for states and actions and by employing a relational regression algorithm to approximate the Q-values generated through a standard Q-learning algorithm. Due to the use of a more expressive representation language to represent states, actions and Q-functions, the proposed relational reinforcement learning system can be potentially applied to a wider range of learning tasks than conventional reinforcement learning. It also enables the abstraction from or parametrization of specific goals or even of specific learning environments and allows for the exploitation of results from previous learning phases when addressing new (more complex), but related situations. Relational representations also permit the use of structural information or the existence of objects and relations between objects in the description of the resulting policy (through the learned Q-function approximation). The relational regression algorithm used in the RRL system generalizes over learning examples with a continuous target value and makes predictions about the value of unseen state-action pairs, using a relational representation for both the learning examples and the resulting function. The RRL setting places a number of constraints on the regression techniques that can be used. Learning data generated by a Q-learning algorithm is incremental (i.e., the regression algorithm should be able to both use and update its current model) and it supplies the regression algorithm with a moving target function, as the estimates of Q-values of examples are mostly random at the start of learning but become more reliable with more experience. Because of the relational representation used for states and actions, the algorithm cannot treat the set of all examples as a vector space. Although in all practical applications the dimension of the state space will be finite, it will not be known at the start of the learning experiment and may even vary during the experiment (e.g., when the number of objects in the agent’s environment varies). Three incremental relational regression algorithms are developed that can be used in the RRL system. The TG algorithm incrementally builds first order re-
منابع مشابه
Multi-agent Relational Reinforcement Learning
In this paper we study Relational Reinforcement Learning in a multi-agent setting. There is growing evidence in the Reinforcement Learning research community that a relational representation of the state space has many benefits over a propositional one. Complex tasks as planning or information retrieval on the web can be represented more naturally in relational form. Yet, this relational struct...
متن کاملOnline Learning and Exploiting Relational Models in Reinforcement Learning
In recent years, there has been a growing interest in using rich representations such as relational languages for reinforcement learning. However, while expressive languages have many advantages in terms of generalization and reasoning, extending existing approaches to such a relational setting is a non-trivial problem. In this paper, we present a first step towards the online learning and expl...
متن کاملLearning with Whom to Communicate Using Relational Reinforcement Learning
Relational reinforcement learning is a promising direction within reinforcement learning research. It upgrades reinforcement learning techniques by using relational representations for states, actions, and learned value-functions or policies to allow natural representations and abstractions of complex tasks. Multiagent systems are characterized by their relational structure and present a good e...
متن کاملFunction Approximation in Hierarchical Relational Reinforcement Learning
Recently there have been a number of dif ferent approaches developed for hierarchi cal reinforcement learning in propositional setting We propose a hierarchical version of relational reinforcement learning HRRL We describe a value function approximation method inspired by logic programming which is suitable for HRRL
متن کاملRelational Reinforcement Learning for Agents in Worlds with Objects
In reinforcement learning, an agent tries to learn a policy, i.e., how to select an action in a given state of the environment, so that it maximizes the total amount of reward it receives when interacting with the environment. We argue that a relational representation of states is natural and useful when the environment is complex and involves many interrelated objects. Relational reinforcement...
متن کاملMulti-Agent Relational Reinforcement Learning Explorations in Multi-State Coordination Tasks
In this paper we report on using a relational state space in multi-agent reinforcement learning. There is growing evidence in the Reinforcement Learning research community that a relational representation of the state space has many benefits over a propositional one. Complex tasks as planning or information retrieval on the web can be represented more naturally in relational form. Yet, this rel...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2001